Contrast Subgraph Mining from Coherent Cores
نویسندگان
چکیده
Graph paern mining methods can extract informative and useful paerns from large-scale graphs and capture underlying principles through the overwhelmed information. Contrast analysis serves as a keystone in various elds and has demonstrated its eectiveness in mining valuable information. However, it has been long overlooked in graph paern mining. erefore, in this paper, we introduce the concept of contrast subgraph, that is, a subset of nodes that have signicantly dierent edges or edge weights in two given graphs of the same node set. emajor challenge comes from the gap between the contrast and the informativeness. Because of the widely existing noise edges in real-world graphs, the contrast may lead to subgraphs of pure noise. To avoid such meaningless subgraphs, we leverage the similarity as the cornerstone of the contrast. Specically, we rst identify a coherent core, which is a small subset of nodes with similar edge structures in the two graphs, and then induce contrast subgraphs from the coherent cores. Moreover, we design a general family of coherence and contrast metrics and derive a polynomial-time algorithm to eciently extract contrast subgraphs. Extensive experiments verify the necessity of introducing coherent cores as well as the eectiveness and eciency of our algorithm. Real-world applications demonstrate the tremendous potentials of contrast subgraph mining. ACM Reference format: Jingbo Shang1, Xiyao Shi1, Meng Jiang2, Liyuan Liu1, Timothy Hanray3, Jiawei Han1. 2018. Contrast Subgraph Mining from Coherent Cores. In Proceedings of ACM SIGKDD Conference on Knowledge Discovery and Data Mining, London, UK, August 2018 (KDD’18), 9 pages. DOI: 10.475/123 4
منابع مشابه
Accurate Classification of Protein Structural Families Using Coherent Subgraph Analysis
Protein structural annotation and classification is an important problem in bioinformatics. We report on the development of an efficient subgraph mining technique and its application to finding characteristic substructural patterns within protein structural families. In our method, protein structures are represented by graphs where the nodes are residues and the edges connect residues found wit...
متن کاملParallel Subgraph Mining on Hybrid Platforms: Hpc Systems, Multi-cores and Gpus
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . x
متن کاملMining Density Contrast Subgraphs
Dense subgraph discovery is a key primitive in many graph mining applications, such as detecting communities in social networks and mining gene correlation from biological data. Most studies on dense subgraph mining only deal with one graph. However, in many applications, we have more than one graph describing relations among a same group of entities. In this paper, given two graphs sharing the...
متن کاملA Scalable Parallel Approach for Subgraph Census Computation
Counting the occurrences of small subgraphs in large networks is a fundamental graph mining metric with several possible applications. Computing frequencies of those subgraphs is also known as the subgraph census problem, which is a computationally hard task. In this paper we provide a parallel multicore algorithm for this purpose. At its core we use FaSE, an efficient network-centric sequentia...
متن کاملCohesive Subgraph Mining on Attributed Graph
Finding cohesive subgraphs is a fundamental graph problem with a wide spectrum of applications. In this paper, we investigate this problem in the context of attributed graph, where each vertex is associated with content (e.g., geo-locations, tags and keywords). To properly capture the cohesiveness of the vertices in a subgraph from both graph structure and vertices attribute perspectives, we ad...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1802.06189 شماره
صفحات -
تاریخ انتشار 2018